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VALIDATION  OF  NON  VERBAL  MEASURES  FOR  SELECTION 
AND  CLASSIFICATION  OF  ENLISTED  PERSONNEL 


I.  INTRODUCTION 

The  selection  and  classification  of  Air  Force  personnel  have  been  dependent  upon  traditional 
paper-and -pencil  tests  for  many  years.  Based  on  the  enlistees’  test  results,  interests,  and  Air  Force  needs  at 
the  time,  an  assignment  to  a particular  career  area  or  job  is  made  depending  on  the  prerequisite 
qualifications  for  such  assignment.  For  some  time,  there  has  been  a growing  concern  as  to  whether  these 
conventional  aptitude  measures  and  the  scores  derived  from  these  tests  accurately  reflect  an  individual's 
capability  or  potential.  Such  aptitude  tests  measure  a limited  portion  of  an  individual’s  mental  abilities, 
namely  those  implicit  in  verbal  skills,  and  have  ignored  other  types  of  abilities  and  problem  solving  skills.  In 
addition,  numerous  factors  have  been  identified  as  having  an  adverse  effect  on  scores  from  traditional  tests: 
lack  of  motivation,  poor  educational  opportunities,  limited  exposure  to  various  news  and  information 
media,  reading  disability,  and  inadequacies  in  the  family’s  educational,  cultural,  social,  and  economic 
background  (Anastasi,  1968;  Freeberg,  1970;  Haggard,  1952). 

As  early  as  1941,  non-verbal  aptitude  measures  were  being  developed  in  an  effort  to  provide  tests 
suitable  for  individuals  with  limited  reading  skills  and/or  lower  levels  of  intellectual  ability  (Cattell.  1950: 
Pintner,  1945;  Porteus,  1950;  Raven,  1941).  Such  tests  developed  to  meet  this  include  Raven’s  “Progressive 
Matrices,”  1941;  Cat  fell’s  “Culture-Fair  Intelligence  Test,”  1940;  “Tire  Davis-Eells  Game  Test."  1953; 
“The  Leiter  Performance  Scale,”  1940;  and  the  “Goodenough-Harris  Drawing  Test,”  1963. 

In  the  military  setting,  one  of  the  initial  efforts  to  test  illiterate  or  disadvantaged  personnel  resulted  in 
development  of  the  “Army  Beta”  during  World  War  1 (Yerkes,  1921).  Other  governmental  agencies,  both 
federal  and  state,  have  attempted  to  develop  non-verbal  test  measures  in  lieu  of  more  verba)  measures.  One 
example  is  the  development  of  the  non-reading  edition  of  the  General  Aptitude  Test  Battery  (GATB) 
developed  by  the  U.S.  Employment  Service.  This  version  of  the  GATB  is  used  to  measure  the  aptitude  of 
job  applicants  and  candidates  for  occupational  training  who  have  previously  had  a meager  or  unsuccessful 
experience  with  tests  of  a more  verbal  nature  (U.S.  Department  of  Labor,  1968). 

In  general,  non-verbal  tests  have  not  been  found  to  correlate  well  with  verbal  measures  of  intelligence 
or  with  academic-type  criteria  (Rulon,  1950).  However,  in  an  Army  study,  Maier  (1971)  found  that 
although  non-verbal  tests  did  not  form  an  independent  and  separate  cluster  from  verbal  tests,  the 
relationship  between  verbal  and  non-verbal  tests  appeared  to  be  dependent  upon  the  specific  test  content  of 
the  non-verbal  measures.  For  example,  some  non-verbal  tests  such  as  shop  mechanics  were  closely  related  to 
mechanical-type  tests  with  little  verbal  content;  on  the  other  hand,  radio  code,  clerical  speed,  and  pattern 
analysis  were  found  to  be  more  closely  related  to  academic-type  tests.  Regardless  of  the  relationship 
reported  between  verbal  and  non-verbal  tests,  it  is  generally  recognized  that  non-verbal  measures  are 
extremely  valuable  in  providing  additional  unique  information  concerning  an  individual’s  capabilities 
(Anastasi,  1968).  In  an  Air  Force  study,  Wilboum  (1973)  found  that  non-verbal  tests,  when  added  to  (he 
selector  aptitude  indexes  (Mechanical,  Administrative,  General,  Electronics),  added  a significant  and  unique 
contribution  to  the  prediction  of  technical  training  school  success.  He  concluded  that  the  use  of  non-verbal 
tests,  as  well  as  other  aptitudinal  and  educational  data,  could  make  a significant  contribution  if  added  to 
the  Air  Force  operational  selection  and  classification  battery. 

The  problem  of  economic  utilization  of  manpower  has  become  increasingly  important  with  the 
all-volunteer  force  structure.  The  objective  of  the  Air  Force  classification  and  assignment  system  is  to 
optimize  the  assignment  of  personnel  to  particular  career  areas  where  their  talents  and  abilities  match  those 
required  by  the  job.  This  objective  is  no  less  important  for  those  who  demonstrate  lower  levels  of  ability  as 
measured  by  traditional  paper-and-pencil  tests.  In  an  effort  to  estimate  the  abilities  of  lower  level  personnel 
more  accurately,  this  study  relates  performance  on  a non-verbal  test  to  performance  in  technical  training. 
The  usefulness  of  these  tests  along  with  additional  aptitudinal  and  educational  data  was  also  evaluated  in  an 
effort  to  refine  and  improve  the  selection  and  classification  procedures  currently  used  by  the  Air  Force. 


IL  METHOD 


The  sample  population  consisted  of  13,584  non-prior  service  male  basic  airmen  tested  during  the 
period  3 August  73  through  15  August  75.  The  sample  was  divided  into  34  subgroups  based  on  the  initial 
technical  training  course  to  which  they  were  assigned.  The  technical  training  courses  used  for  validation 
analyses  and  the  number  of  subjects  in  each  career  area  are  listed  in  Table  1.  Since  one  of  the  objectives  of 
this  study  was  to  assess  the  usefulness  of  non-verbal  tests  among  lower  ability  personnel,  specific  analyses 
were  included  for  the  lower  mental  ability  categories.  Lower  mental  ability  personnel  include  those 
individuals  who  are  classified  as  Category  111  and  IV  based  on  their  percentile  scores  of  10  through  64  on 


Table  1.  Sample  Population  by  Air  Force  Specialty  Code 

Sal 

Total 

Cat  III 

AFSC 

Title 

Al 

N* 

I.  IV  N» 

25231 

Weather  Observer 

G-80 

145 

27230 

Air  Traffic  Control  Operator 

A/G-60 

390 

173 

29130 

Telecommunications  Operations  Specialist 

A/G-60 

143 

30332 

AC&W  Radar  Repairman 

E-80 

99 

30430 

Radio  Relay  Equipment  Repairman 

E-80 

138 

30434 

Ground  Radio  Communications  Equipment  Repmn 

E-80 

275 

32231 

Weapon  Control  Systems  Mechanic 

E-80 

145 

32830 

Avionics  Communications  Specialist 

E-80 

100 

32833 

Electronic  Warfare  System  Specialist 

E-80 

126 

32834 

Avionic  Inertial  and  Radar  Navigation  Sys  Spl 

E-80 

99 

42132 

Acft  Pneudraulic  Repairman 

M/E-40 

132 

98 

42133 

Aerospace  Ground  Equipment  Repairman 

M/E40 

450 

365 

43131 

Aircraft  Maintenance  Specialist 

M/E-50 

2,295 

1,499 

43230 

Jet  Engine  Mechanic 

M40 

742 

503 

44330 

Missile  Mechanic 

M-50 

162 

114 

46130 

Munitions  Maintenance  Specialist 

M/E-60 

569 

368 

46230 

Weapons  Mechanic 

M/E-60 

610 

389 

53430 

Airframe  Rcapir  Specialist 

M-40 

196 

128 

54330 

Electrical  Power  Production  Specialist 

M/E-50 

214 

160 

57130 

Fire  Protection  Specialist 

G40 

388 

265 

60530 

Air  Passenger  Specialist 

A-50 

121 

60531 

Air  Cargo  Specialist 

M-50 

350 

233 

62230 

Food  Service-Cook 

G-40 

112 

63130 

Fuel  Specialist 

G/M-40 

305 

241 

64530 

Inventory  Management  Specialist 

A/G-60 

669 

409 

64730 

Materiel  Facilities  Specialist 

G-60 

229  . 

184 

70230 

Administrative  Specialist 

A-40 

724 

579 

73230 

Personnel  Specialist 

A-60 

213 

137 

81 130 

Security  Specialist 

G-40 

1,996 

1,428 

81230 

Law  Enforcement  Specialist 

G-50 

601 

335 

90010 

Medical  Helper 

G-60 

352 

178 

90230 

Medical  Service  Specialist 

G-60 

267 

123 

90630 

Medical  Administrative  Specialist 

G-60 

116 

92230 

Aircrew  Life  Support  Specialist 

G-40 

111 

a Analyses  on  those  AFSC  samples  of  less  than  150  were  not  cross-validated. 


analyses. 


Those  AFSCs  with  blanks  inserted  did  not  contain  a sufficient  number  of  subjects  for  separate  Category  I1I/IV 
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the  Aimed  Forces  Qualification  Test  (AFQT).  Of  the  34  career  areas,  21  had  a sufficient  number  of 
Category  DI  and  IV  personnel  to  allow  separate  analyses  for  these  subgroups  alone.  When  sufficient 
numbers  of  enlistees  (N  = 150  or  more)  were  available,  the  technical  course  subgroups  were  further  divided 
into  random  halves  for  validation  and  cross-validation  analyses. 

All  subjects  were  administered  the  Non-Verbal  Aptitude  Battery  - Revised  (NVAB-R)  which  consists 
of  eight  subtests  described  in  Table  2.  In  addition,  aptitudinal  information  (Armed  Services  Vocational 
Aptitude  Battery  — Form  3 [ASVAB-3]  aptitude  index  scores,  and  AFQT  percentile  scores)  was  retrieved 
from  airman  record  files  maintained  by  the  Personnel  Research  Division  of  the  Air  Force  Human  Resources 
Laboratory  (AFHRL).  Table  3 lists  the  subtest  components  for  the  four  Selector  Aptitude  Indexes  (Al)  of 
the  ASVAB-3.  Criterion  data  (final  technical  school  grades)  were  obtained  from  the  tape  files  of  the 
Computational  Sciences  Division  of  AFHRL. 


Table  2.  Description  of  the  Non-Verbal  Aptitude  Battery  - Revised 


Title 

Description 

Number 
of  Items 

Time 

Limit  (min)* 

(1)  Number  Reversal 

A measure  of  perceptual  speed  and 
accuracy  in  finding  the  exact  reversal 
of  a series  of  1 to  7 digits 

48 

7 

(2)  Pattern  Matching 

A measure  of  abstract  reasoning 
abilities  comprised  of  pictorial 
problems  which  require  the  subject 
to  select  the  part  that  completes  a 
specified  pattern  among  five  alternatives. 

38 

20 

(3)  Dial  Reading 

A measure  of  numerical  ability  which 
requires  the  subject  to  read  a dial 
quickly  and  accurately. 

30 

4 

(4)  Paired  Letters 

A measure  of  perceptual  speed  and 
clerical  ability  in  finding  a pair 
of  letters  or  figures  identical  to 
the  underlined  pair  in  each  item. 

34 

3 

(5)  Wheels. 

A measure  of  mechanical  reasoning 
in  which  the  subject  determines  the 
turning  direction  of  a series  of  wheels 
when  the  direction  of  one  wheel  in  the 
series  is  given. 

60 

10 

(6)  Figure  Analogies 

A measure  of  abstract  reasoning  ability 
in  which  the  subject  is  required  to 
determine  how  figures  can  be  alike. 

30 

10 

(7)  Card  Patterns 

A reasoning  test  comprised  of  pictorial 
problems  using  playing  card  suits  arranged 
in  patterns  and  series. 

50 

20 

(8)  Dominoes 

A reasoning  test  comprised  of 
pictorial  problems  of  dominoes  arranged 
in  numeric  patterns  and  series. 

88 

25 

*Doe«  not  include  instructions  and  sample  items. 
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Table  3.  Subtests  and  Aptitude  Composites  Comprising  the  Armed 
Services  Vocational  Aptitude  Battery  — Form  3* 


Subtest 

Number 
of  Items 

Subtest  Composite  (or  Aptitude  Index 

Mechanical 

Administrative 

General 

Electro  nice 

Coding  Speed 

100 

X 

Word  Knowledge 

25' 

X 

X 

Arithmetic  Reasoning 

25 

X 

X 

Tool  Knowledge 

25 

X 

- 

Space  Perception 

25 

X 

Mechanical  Comprehension 

25 

X 

Shop  Information 

25 

X 

Automobile  Information 

25 

X 

Electronics  Information 

25 

* 

X 

aEffcctivc  July  1973  through  September  1975.  < 

Multiple  linear  regression  analyses  (Bottenberg  & Ward,  1963)  were  used  to  determine  the  usefulness 
of  the  non-verbal  tests,  educational  and  aptitudinal  data,  both  with  and  without  the  Selector  AI  in 
predicting  final  school  grade  (FSG).  Table  4 describes  the  variables  used  in  these  analyses. 

Table  4.  List  of  Variables 


Variables  Description 


Predictor 


AFQT  score 
Aptitude  indexes 
Educational  level 
Non-verbal  tests 

Selector  Aptitude  Index  (AO 


Continuous  variable  based  on  percentile  score  obtained  on  Armed  Forces 
Qualifying  Test  (AFQT). 

Continuous  variables  based  on  percentile  score  obtained  on  Armed  Services 
Vocational  Aptitude  Battery  - Form  3 (ASVAB-3). 

Categorical  variable  with  three  categories:  high  school  non-graduate,  high 
school  graduate,  attended/graduated  from  college. 

Continuous  variables  based  on  raw  scores  obtained  on  each  of  the  following 
non-verbal  tests:  number  reversal,  pattern  matching,  dial  reading,  paired 
letters,  wheels,  figure  analogies,  card  patterns,  and  dominoes. 

Continuous  variable  based  on  percentile  score  on  aptitude  index  used  as 
prerequisite  for  entry  into  a particular  career  field. 


Criterion 

Final  school  grade  Numeric  grade  assigned  upon  completion  of  technical  training  course. 


IIL  RESULTS  AND  DISCUSSION 


Validity  of  the  Non-Verbal  Tests 

Table  5 presents  the  restricted  correlations  for  the  validation  subgroups  between  each  of  the 
non-verbal  tests  and  FSG  criterion.  To  facilitate  comparisons  among  the  Air  Force  specialty  codes  (AFSCs), 
the  training  courses  were  categorized  into  six  broad  technical  areas  based  on  similarity  of  job  duties  and/or 
Selector  AI  level.  The  relationship  between  the  eight  subtests  and  the  criterion  varies  widely  among  the 
technical  areas.  In  12  of  the  34  courses,  all  eight  subtests  showed  a statistically  significant  relationship.  In 
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an  additional  11  areas,  a majority  of  the  subtests  (i.e.,  four  to  seven)  also  demonst.ated  significant 
relationships.  On  the  other  hand,  in  Personnel  (73230),  Food  Service— Cook  (62230),  and  two  of  the 
Electronics  courses  (30332,  30430),  none  of  the  non-verbal  tests  were  found  to  be  significantly  related  to 
the  criterion.  In  general,  it  appears  that  a larger  number  of  the  subtests  show  significant  relationships  in  the 
Mechanical  specialties.  In  the  Electronics  area,  the  opposite  trend  is  prevalent.  No  obvious  trend  appears 
between  the  Selector  A1  level  required  for  course  entry  and  the  significance  of  the  non-verbal/criterion 
relationships.  Although  no  significant  relationships  were  found  in  five  of  the  E-80  Selector  A1  courses, 
significant  correlations  were  noted  in  two  other  E-80  Electronics  courses  and  the  G-80  Weather  course. 
Among  the  individual  subtests,  figure  analogies  and  dial  reading  demonstrated  substantial  relationships  in  a 
majority  of  the  technical  training  areas,  while  card  patterns  had  the  lowest  number  of  statistically 
significant  relationships  across  all  the  technical  areas. 

Results  of  the  correlational  analysis  for  lower  mental  ability  personnel  are  presented  in  Table  6. 
Thirteen  of  the  technical  courses  had  an  insufficient  number  of  individuals  to  permit  separate  analyses  of 
the  low  ability  subgroup.  In  addition  to  the  high  Selector  A1  level  courses  (i.e.,  A1  level  = 80)  where  few,  if 
any,  Category  111  and  IV  personnel  would  qualify,  five  other  AFSCs  had  to  be  deleted  from  these  analyses. 
In  the  smaller  subgroups  containing  only  the  lower  categories,  significant  correlations  between  the 
non-verbal  subtests  and  the  criterion  were  less  prevalent.  In  interpreting  the  results  obtained  in  the  lower 
ability  level  subgroups,  it  should  also  be  realized  that  the  small  number  of  cases  in  several  of  the  courses 
tend  to  make  the  relationships  found  somewhat  unreliable.  In  only  two  of  the  technical  training  areas 
(Aerospace  Ground  Equipment  Repair  [42133]  and  Security  Police  [81130])  were  the  relationships 
between  the  non-verbal  subtests  and  criterion  found  to  be  statistically  significant  for  each  of  the  subtests. 
Similar  to  the  all-category  subgroups,  the  two  non-verbal  subtests  found  to  have  the  largest  number  of 
significant  relationships  across  the  technical  training  areas  were  dial  reading  and  figure  analogies.  Those 
subtests  showing  the  least  relationship  to  the  criterion  were  card  patterns  and  number  reversal.  Although 
the  correlations  found  among  the  lower  category  personnel  appear  somewhat  attenuated,  the  validities  of 
one  or  more  of  the  non-verbal  tests  are  of  sufficient  size  to  emphasize  their  potential  usefulness  in  selection 
and  classification  instruments,  particularly  with  the  lower  ability  levels  of  enlisted  accessions. 

To  determine  the  usefulness  of  the  entire  non-verbal  battery,  multiple  correlations  were  computed  for 
the  non-verbal  composite  comprised  of  all  eight  subtests.  These  correlations  are  contained  in  Table  7 for 
both  the  all-category  and  lower  category  subgroups.  For  the  all-category  samples,  the  non-verbal  composite 
was  significantly  related  to  FSG  in  27  of  the  34  technical  training  areas.  There  was  no  trend  or  similarity 
noted  among  the  courses  in  which  the  non-verbal  composite  failed  to  reach  the  required  significance  level. 

For  the  lower  category  personnel,  the  non-verbal  composite  was  statistically  significant  in  16  of  the 
21  courses.  Three  of  the  courses  in  which  non-significant  correlations  were  noted  were  identical  to  the 
courses  where  non-significant  correlations  were  found  in  the  all-category  samples  (Missile  Mechanic 
[44330] , Personnel  [73230] , and  Electrical  Power  Production  [54330]). 


To  determine  the  stability  of  the  non-verbal  composites  from  one  sample  to  another,  a 
cross-validation  procedure  was  utilized.  Where  sample  size  was  sufficient,  the  population  was  randomly 
divided  into  two  half-samples.  Where  the  validation  composites  were  significant,  regression  weights 
developed  on  half-sample  1 (validation  sample)  were  then  cross-applied  to  half  sample  2 (cross-validation 
sample).  The  results  are  shown  in  Table  7,  Column  A/B,  All  Non-Verbal.  Some  shrinkage  in  the  magnitude 
of  the  correlation  coefficients  was  evidenced.  Of  the  21  courses  permitting  cross-application  in  the. 
all-category  samples,  18  maintained  a level  of  significance.  In  the  lower  ability  subgroups,  three  of  the 
courses  in  which  significant  correlations  were  noted  in  the  validation  sample  demonstrated  non -significant 
relationships  upon  cross-validation.  These  somewhat  spurious  results  found  in  the  Category  III  and  IV 
subgroups  may  be  a result  of  the  small  sample  size.  Although  the  cross-validation  results  make  the  use  of 
non-verbal  measures  doubtful  in  some  areas,  the  fact  that  the  non-verbal  composite  did  reach  and  maintain 
a level  of  significance  in  both  validation  and  cross-validation  samples  in  a sizable  number  of  courses 
indicates  their  potential  usefulness  as  possible  subtests  in  future  revisions  of  operational  tests. 
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Table  6.  Correlations  Between  Non-Verbal  Tests  and  Final  Technical  School  Grade 
Validation  Sample  - Category  111  - IV 


hihli  Multiple  C orrelations  of  Non-Verbal  Composite 
and  C ross-Validation  Results 


All  Categories 

Categories  III  & IV  Only 

Area 

AFSC 

N 

All  NVC 

Aa  A/B*5 

N 

All  NVC 

A>  A/8b 

hie  cl  ronies 

30332 

66d 

.25 

_ 

x« 

_ 

30430 

1 38d 

.26 

— 

X 

— 

_ 

30434 

137 

.37** 

.19* 

X 

_ 



32231 

145d 

.37** 

— 

X 

— 

_ 

32830 

100li 

.38* 

— 

X 

— 



32833 

126d 

.53** 

— 

X 

— 

_ 

32834 

66d 

.51** 

- 

X 

- 

- 

Mechanical 

42132 

132d 

.46** 

_ 

98d 

.48** 



42133 

225 

.41** 

.41** 

182 

.45** 

.29* 

43131 

1,147 

29** 

.28** 

749 

.22** 

.19** 

43230 

371 

.37**  ' 

.43** 

251 

.26* 

.29** 

44330 

81 

.36 

- 

ll4d 

.34 

— 

46130 

284 

.36** 

.15* 

.193 

.30* 

.04 

46230 

305 

.23* 

.18** 

194 

.19 

— 

53430 

68 

49** 

29** 

128d 

.38** 

_ 

54330 

107 

.29 

- 

80 

.20 

- 

Administrative 

60530 

121 d 

.36* 

_ 

X 

_ 

_ 

60531 

175 

44** 

.37** 

116 

.36* 

.27** 

64530 

334 

.31** 

2i** 

204 

.29* 

.03 

70230 

362 

.38** 

.23** 

289 

.24* 

.23** 

73230 

106 

.33 

- 

137d 

.27 

- 

Medieal/Dental 

90010 

176 

.52** 

.43** 

89 

.50** 

.30** 

60230 

133 

.55** 

.42** 

123d 

.37* 

_ 

60630 

1 16d 

.37* 

- 

X 

- 

- 

Serviees/Supply 

57130 

164 

.28* 

.24** 

132 

38** 

.18* 

62230 

1 12d 

.32 

— 

X 

— 

— 

63130 

152 

.59** 

.44** 

120 

.44** 

.53** 

64730 

1 14 

.38* 

.44** 

92 

.49** 

.17 

81  130 

666 

.38** 

.32** 

714 

.27** 

.34** 

81230 

300 

42** 

.31** 

167 

.35** 

.23** 

62230 

IIId 

.29 

- 

X 

- 

- 

Communications/Operations 

25231 

145d 

.43** 

_ 

X 

_ 



27230 

195 

.41** 

.31** 

86 

.36 

— 

26130 

143d 

.46** 

— 

X 

— 

- 

a“A”  corresponds  to  the  validation  sample. 

“A/B”  corresponds  to  the  cross-application  of  regression  weights, 
includes  all  eight  non-verbal  subtests. 

^Insufficient  N for  cross  application  of  regression  weights,  total  sample  used  where  applicable. 

Insufficient  N for  Category  III  and  IV  analysts. 

^Significant  at  or  beyond  .05  level. 

* * Significant  at  or  beyond  .01  level. 
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I’rcdiitiu'  l t il it v of  Sun  \i  il>;il  Haller)  Versus  Selector  \l 

Beloie  iccommeiklmg  the  possible  inclusion  ul  non-verbal  1 os  I s in  an  «>|>«.*i;iiu>iial  l>at  tci  n . n is 
ik'a.ss.ii>  In  deleimilic  wlietliei  the  pi  o posed  Icsls  add  significantly  In  I lie  predict  inn  nl  pcrfoi  malice 
criteria  nvci  ami  abuse  the  aptitude  measures  cuirently  in  use.  To  do  ibis,  a series  nl  regression  analyses 
were  accomplished  in  which  the  unir|ue  and  valid  contribution  of  the  non-verbal  t ests  over  and  above  the 
Sclectm  Al  was  assessed.  Results  ol  these  analyses  are  presented  in  Table  8,  Regression  Analysis  1 . In  2 1 of 
the  24  career  specialties,  il  was  found  in  the  all-category  samples  that  the  non-verbal  tests  added 
significantly  to  piediction  of  training  performance  over  the  efficiency  of  the  Selector  Al  used  alone. 

l or  the  Category  111  and  !V  samples,  as  seen  in  Table  8,  the  utility  of  the  non-verbal  battery  over  and 
above  the  Selector  Al  was  demonstrated  in  14  of  the  21  career  areas.  The  results  in  both  the  all-category 
and  lower  category  subgroups  appear  to  indicate  that  the  predictive  efficiency  of  the  non-verbal  battery  is 
not  confined  to  any  specific  group  of  specialties  but  is  generally  applicable  to  a majority  of  career  areas. 
The  widespread  utility  of  the  non-verbal  composite  in  a variety  of  specialty  areas  further  substantiates  their 
potential  usefulness  in  an  operational  test  battery. 

The  stability  of  the  relationships  between  the  technical  training  criterion  and  the  non-verbal/Selector 
Al  composites  was  also  determined  by  cross-validation.  The  correlation  coefficients  obtained  from 
cross-application  of  regression  weights  to  half-sample  2 are  contained  in  Table  9.  In  the  all-category  group, 
composites  were  significantly  related  to  the  criterion  in  29  of  the  34  courses.  Where  cross-application  was 
possible,  composites  maintained  their  significance  in  all  instances.  For  the  lower  category  subgroups.  15  of 
the  21  technical  areas  had  significant  composites  in  the  validation  sample.  In  the  12  courses  where 
cross-application  was  applied,  all  composites  except  Materiel  Facilities  Specialist  (64730)  remained 
significant. 

Usefulness  of  Additional  Aptitudinal/Educational 
Data  in  Predicting  Training  Performance 

In  previous  research,  results  concerning  the  utility  of  using  additional  aptitudinal  and  educational 
data  in  the  selection  and  classification  process  have  varied  (Brokaw,  1963;  Judy,  1960,  1965;  Lecznar, 
1964).  However,  findings  have  generally  indicated  that  data  such  as  educational  level  and  aptitude  scores 
other  than  the  Selector  Al  demonstrated  a significant  relationship  with  performance  criteria,  especially  for 
lower  mental  ability  individuals.  In  a recent  Air  Force  study,  Wilbourn  (1973)  found  that  additional 
aptitudinal  and  educational  data  did  add  significantly  to  the  operational  Als  in  many  instances. 

To  assess  the  predictive  efficiency  of  such  data  in  a larger  number  of  technical  areas,  regression 
equations  based  on  a composite  of  predictor  variables  including  the  four  Als,  AFQT  score,  educational 
level,  and  the  eight  non-verbal  tests  were  compared  to  the  equations  based  on  a combination  of  the  Selector 
Al  and  non-verbal  tests  alone. 

Results  of  these  regression  analyses  are  presented  in  Table  10,  Regression  Analysis  2.  A composite  of 
high  school  graduation  status,  AFQT  score,  and  Als  other  than  the  Selector  Al  increased  the  predictive 
efficiency  over  the  non-verbal/Selector  Al  composite  in  25  of  the  34  all-category  and  in  12  of  21  Category 
111  and  IV  courses. 

As  shown  in  Table  11,  the  multiple  correlation  composites  are  significant  in  all  but  one  course.  Food 
Service— Cook  (62230),  in  the  all-category  samples.  All  composites  remained  statistically  significant  upon 
cross-validation.  While  the  multiple  correlation  coefficients  were  statistically  significant  in  all  but  two  of  the 
lower  category  samples,  in  the  cross-validation  process,  two  additional  composites  failed  to  reach  statistical 
significance. 

In  this  age  of  high  speed  computers,  individual  classification  and  assignment  procedures  utilizing 
multiple  predictors  is  not  only  feasible  but  should  provide  a more  cost-effective  procedure  for  assigning 
enlisted  accessions  to  high  cost  training  programs. 

Development  of  Best  Composite  for  Individual  Specialties 

Although  the  potential  usefulness  of  additional  data  in  the  classification  and  assignment  system  has 
been  established,  an  optimal  composite  of  predictors  for  individual  specialties  was  explored.  If  certain 
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Table  9.  Multiple  Correlations  of  Non-Verbal/Selector  Al  Composite  and  Stability 
of  Composite  in  Cross-Application  of  Regression  Weights 
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i • 

I; 


I :•* 

1 -- 
#» 

y 

; £ 


Aral 

AFSC 

All  Catasorla, 

Categories  III  A IV  Only 

N 

All  NV  A 
Sel  AlC 

A*  A/B& 

N 

All  NV  A 
Set  Alc 

A*  A/B6 

Electronics 

30332 

99d 

.25 

X1’ 

_ 

30430 

138d 

.26 

- 

X 

— 

- 

30434 

137 

.37** 

.18* 

X 

— 

— 

32231 

145d 

.37** 

— 

X 

— 

— 

32830 

100d 

.26 

- 

X 

- 

- 

32833 

126d 

.53** 

— 

X 

— 

— 

32834 

99d 

.51** 

- 

X 

- 

- 

Mechanical 

42132 

132d 

.53** 

— 

98d 

.53** 

— 

42133 

225 

.43** 

.41** 

182 

.45** 

->9* 

43131 

1,147 

.36** 

.39** 

749 

24** 

23** 

43230 

371 

.55** 

.54** 

251 

42** 

.44** 

44330 

81 

.45* 

.43* 

1 14d 

.34 

— 

46130 

284 

.48** 

.28* 

193 

.38** 

.16* 

46230 

305 

.31** 

.30** 

194 

.27 

— 

53430 

98 

.58** 

.46** 

128d 

.38** 

- 

54330 

107 

.48** 

.42* 

80 

.28 

- 

Administrative 

60530 

1 2 1 d 

.39** 

— 

X 

- 

— 

60531 

175 

.49** 

.36** 

1 16 

.37* 

29** 

64530 

334 

.34** 

.25** 

204 

.30 

- 

70230 

362 

.39** 

.26** 

289 

.25* 

.24* 

73230 

106 

.40* 

.29** 

137d 

.27 

- 

Medical/Dental 

90010 

176 

.60** 

.49** 

89 

.56** 

.39** 

90230 

133 

.60** 

.52** 

123d 

.37* 

- 

90630 

1 16d 

.37* 

- 

X 

- 

- 

Services/Supply 

57130 

194 

.29* 

.29* 

132 

.38** 

.18* 

. 

62230 

1 12d 

.32 

— 

X 

— 

— 

63130 

152 

.61** 

.45** 

120 

.44** 

.53** 

64730 

114 

.38* 

.44** 

92 

.49** 

.18 

81130 

996 

.39** 

.32** 

714 

29** 

.35** 

81230 

300 

.52** 

.41** 

167 

.43** 

.30* 

92230 

IIId 

.30 

- 

X 

- 

- 

Communications/Ope  rations 

25231 

145d 

.43** 

- 

X 

- 

- 

27230 

195 

.47** 

.40** 

86 

.39 

— 

29130 

143d 

.46** 

- 

X 

~ 

— 

3,4  A**  corresponds  to  the  validation  sample. 

A/B"  corresponds  to  the  cross-application  of  regression  weights. 


mcludcs  all  eight  non-verbal  tests  and  Selector  Al. 

^Insufficient  N for  cross-application  of  regression  weights,  total  sample  used  where  applicable, 
insufficient  N for  Category  III  and  IV  analysis. 

•Significant  at  or  beyond  .05  level. 

••Significant  at  or  beyond  .01  level. 
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Table  11.  All  Variable  Composite  Correlations  and  Cross-Validation  Results 


All  Cat«9orl«i 


Category  III  A IV  Only 


Composite  Composite 

All  Vart.c  All  Vars.c 


Art* 

AFSC 

N 

A* 

A/Bb 

N 

A» 

A/Bb 

Electronics 

30332 

99d 

.64** 

X1' 

30430 

1 38d 

.59** 

— 

X 

— 

— 

30434 

137 

.46** 

39** 

X 

— 

— 

32231 

145d 

.45** 

— 

X 

~ 

— 

32830 

100d 

.51** 

— 

X 

— 

— 

32833 

1 26d 

.69** 

— 

X 

— 

— 

32834 

99d 

.68** 

- 

X 

- 

- 

Mechanical 

42132 

132d 

.68** 

_ 

98d 

.70** 

— 

42133 

225 

.48** 

.46** 

182 

.54** 

.33** 

43131 

1,147 

.46** 

.48** 

749 

.40** 

.35** 

43230 

371 

.64** 

.60** 

251 

.56** 

.60** 

44330 

81 

.50** 

.31* 

1 14d 

.42 

— 

46130 

284 

.52** 

.33** 

193 

.45** 

.21* 

46230 

305 

39** 

37** 

194 

.36** 

.32* 

53430 

98 

.63** 

.47** 

128d 

.49** 

— 

54330 

107 

38** 

.65** 

80 

.60** 

.46* 

Administrative 

60530 

1 2 ld 

.57** 

— 

X 

_ 

_ 

60531 

175 

.57** 

.44** 

116 

.49** 

.21* 

64530 

334 

.51** 

.42** 

204 

.49** 

.18* 

70230 

362 

.52** 

.41** 

289 

.40** 

.33** 

73230 

106 

.57** 

.48** 

- 137d 

.42 

- 

Medical/Dental 

90010 

176 

.66** 

.56** 

89 

.62** 

.27* 

90230 

133 

.64** 

.57** 

123d 

32** 

— 

90630 

1 16d 

.61** 

- 

X 

- 

- 

Services/Supply 

57130 

194 

.35** 

.35** 

132 

.44** 

.15 

62230 

1 12d 

.36 

— 

X 

— 

— 

63130 

152 

.67** 

.51** 

120 

.60** 

.46** 

64730 

114 

.41* 

.43** 

92 

33** 

.06 

81130 

996 

.45** 

.37** 

714 

.34** 

.38** 

81230 

300 

.61** 

.45** 

167 

30** 

.37** 

92230 

1 Ud 

.45* 

- 

X 

- 

- 

Communications/Operations 

25231 

145d 

39** 

— 

X 

— 

— 

27230 

195 

33** 

.42** 

86 

.55** 

.41** 

29130 

143d 

.59** 

— 

X 

- 

- 

a“A”  corresponds  to  the  validation  sample. 

“A/B”  corresponds  to  the  cross-application  of  regression  weights. 


clncludes  all  eight  non-verbal  subtests,  AFQT,  ASVAB  Als,  and  Education  level. 
^Insufficient  N for  cross-application  of  regress  weights,  total  sample  used  where  applicable. 


insufficient  N for  Category  III  and  IV  analysis. 


"Significant  at  or  beyond  .05  level. 


"^Significant  at  or  beyond  .01  level. 
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non-verbal  tests  or  data  items  do  not  add  significantly  to  the  prediction  of  the  training  criterion  in  a specific 
course,  there  is  no  reason  to  include  them  in  the  final  regression  equation.  This  phase  of  the  analysis  was 
designed  to  identify  the  most  economical  composite  from  the  standpoint  of  the  total  number  of  predictors 
required  in  the  final  equation  for  each  AFSC.  Only  those  courses  where  significant  multiple  correlations 
were  found  in  the  validation  sample  were  used  for  this  phase  of  the  analysis. 

Since  the  Als  are  derived  from  the  current  operational  test,  these  four  predictors  were  forced  into  the 
composite  since  no  additional  testing  and  no  revision  to  current  operational  tests  would  be  required  if  such 
a composite  were  used  in  an  operational  setting.  The  remaining  predictors  which  could  be  selected  for  the 
operational  composite  included  AFQT  score,  the  eight  non-verbal  tests,  and  educational  level.  The  optimal 
composite  combination  of  predictors  determined  by  regression  analysis  varied  widely  among  specialties  as 
shown  in  Tables  12  and  13.  In  the  Aircrew  Life  Support  (92230)  course,  the  optimal  composite  includes 
only  three  variables  in  addition  to  the  four  Als.  On  the  other  hand,  in  the  Air  Passenger  Specialist  Course 
(60530),  nine  additional  predictor  variables  comprised  the  best  composite.  In  the  lower  categories,  two  to 
nine  variables  in  addition  to  the  Als  were  required  to  develop  the  best  composite. 

It  is  obvious  that  a tailor-made  predictor  equation  for  each  specialty  is  feasible  and  could  be  used  to 
refine  and  improve  the  current  selection,  classification,  and  assignment  procedures  now  in  use. 

IV.  SUMMARY  AND  CONCLUSIONS 

This  study  was  designed  to  determine  the  validity  of  non-verbal  measures  in  the  prediction  of 
technical  training  outcomes  and  assess  the  utility  of  these  measures  as  possible  means  to  refine  and  improve 
the  Air  Force  operational  selection  and  classification  battery  currently  in  use. 

A non-verbal  battery  comprised  of  eight  subtests  was  administered  to  13,584  non-prior  service  airmen 
who  were  assigned  to  technical  training  in  34  different  AFSC  areas  after  completion  of  basic  training. 
Aptitudinal,  education,  and  FSG  criterion  data  were  obtained  from  AFHRL  research  records. 

Correlational  and  regression  analyses  were  used  to  evaluate  the  validity  of  the  individual  subtests  and 
composites  developed  from  the  non-verbal  measures,  aptitude  scores,  and  educational  data  and  assess  their 
usefulness  in  predicting  technical  training  FSG  over  and  above  the  current  operational  ASVAB  Selector  Al. 
Cross-validation  was  accomplished  where  sample  size  was  of  sufficient  size  to  permit  analysis.  In  addition  to 
analyses  based  on  the  total  sample  in  the  34  AFSCs,  separate  analyses  were  accomplished  on  subgroups  of 
lower  mental  ability  personnel  to  determine  the  usefulness  of  these  measures  among  those  individuals  who 
often  demonstrate  difficulty  with  the  traditionally  highly  verbal  test  measures. 

The  zero  order  correlations  between  the  individual  subtests  and  FSG  were  not  found  to  be 
statistically  significant  in  every  technical  course.  Figure  analogies  and  dial  reading  appear  to  demonstrate 
the  highest  relationship  with  FSG  in  a majority  of  technical  courses  in  both  the  total  sample  and  lower 
ability  subgroups.  Results  indicate  that  these  two  subtests  are  good  possibilities  for  additions  or 
replacements  for  verbal  subtests  in  future  revisions  of  ASVAB. 

The  multiple  correlations  based  on  a composite  of  all  eight  non-verbal  subtests  reached  statistical 
significance  in  over  75  percent  of  the  technical  areas  in  the  total  sample  and  lower  mental  category 
subgroups.  Although  some  of  the  subtests  did  not  reach  a level  of  statistical  significance  upon 
cross-validation,  overall  results  indicate  the  possible  utility  of  the  non-verbal  measures  is  not  limited  only  to 
personnel  in  the  lower  levels  of  mental  ability  but  might  be  beneficial  for  testing  all  categories  of  pereonnel. 

Analyses  to  determine  the  utility  of  the  non-verbal  composite  compared  to  the  Selector  AI  alone 
revealed  that  in  approximately  three-fourths  of  the  AFSC  areas,  the  non-verbal  tests  made  an  unique  and 
significant  contribution  to  the  prediction  of  FSG  over  and  above  the  Selector  AI  currently  used  in  the 
selection  and  classification  program.  Although  some  attenuation  of  the  observed  relationships  between  the 
composite  and  FSG  was  also  noted  upon  cross-validation,  it  appears  that  the  use  of  the  non-verbal  tests 
along  with  the  Selector  AI  improves  the  capability  to  select  and  assign  basic  airmen  to  specialty  areas  where 
they  are  most  likely  to  succeed.  When  additional  aptitudinal  and  educational  data  are  added  to  the  multiple 
composite  of  predictors,  the  predictive  efficiency  of  the  composite  increases  even  more. 
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Table  13.  Prediction  of  the  Best  Composite  by  Training  Course  (Category  111  & IV) 
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Even  though  the  use  of  a multi-predictor  system  for  selection  and  classification  (roses  no  technical 
problems  in  the  age  of  high-speed,  accurate  computers,  it  appears  more  economical  to  tailor  the  composite 
of  variables  to  each  specialty  area.  Tliis  approach  would  lend  maximum  tlexibility  in  the  use  of  additional 
test  and/or  educational  data  for  classification  purposes  yet  restrict  the  overall  number  of  variables  to  only 
those  which  make  a definite  contribution  to  the  prediction  system. 

These  findings  substantiate  the  validity  of  non-verbal  measures  and  the  potential  utility  of  including 
these  measures  in  future  operational  test  batteries.  Since  the  traditional,  highly  verbal  tests  have  been 
criticized  for  possible  discrimination  against  individuals  with  disadvantaged  backgrounds,  lower  mental 
ability,  and/or  reading  disabilities,  these  tests  should  not  only  help  to  estimate  the  “true  potential”  of  these 
disadvantaged  personnel  but  refine  and  improve  the  selection  and  classification  system  for  all  categories  of 
personnel. 
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